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1. Introduction 



In this report* we discuss relational programming , i.e. a 
style of programming in which entire relations are manipulated 
rather than individual data. This is analogous to functional 
programming [1]/ wherein entire functions are the values manipu- 
lated by the operators. We will see that relational programming 
subsumes functional programming because every function is also a 
relation. It is appropriate at this point to discuss why we have 
chosen to investigate relational programming. The reader can 
find a shorter introduction to relational programming in [12]. 

As we have noted, relational programming subsumes functional 
programming; hence, anything that can be done with functional 
programming can be done with relational programming. Further- 
more, relational programming has many of the advantages of func- 
tional programming: for instance, the ability to derive and 
manipulate programs by algebraic manipulation. A well developed 
algebra of relations dates back to Boole's original work and has 
been extensively studied since then. Although relations are more 
general than functions, their laws are often simpler. For 
instance, (fg)~^ = g~^f~^ is true for all relations, but true 
only for functions that are one-to-one. Also, relational pro- 
gramming more directly supports non-linear data structures, such 
as trees and graphs, than does functional programming. In rela- 
tional programming the basic data values are themselves rela- 
tions, whereas in functional programming there is a separate 
class of objects (lists) used for data structures. One final 
reason for investigating relational programming is that it pro- 
vides a possible paradigm for utilizing associative and active 
memories. As a teaser for what is to come, we present the fol- 
lowing example of a relational program. We will take a text T, 
represented as an array of words (i.e., T:i is the i-th word), 
and generate a frequency table F so that F;w is the number^of 
occurences of word w in T. Now we will see (section 4) that T:w 
is the set of all indices of the word w. If we let #:C be the 
cardinality of a^class, then the number of indices^ (occurences) 
of w is just D:(T:w). Therefore we can write F = #T (Section 7). 



2. Classes and Relations 



2. 1 basic concepts 



Our relational calculus will deal with three sorts of 
things: individuals, classes and relations. These can best be 
illustrated by example. If 'x' is the name of an individual and 
'C is the name of a class, then ' x€C ' means that the individual 
denoted by 'x' is a member of the class denoted by 'C' (i.e.. 



* The work reported herein was supported by the Foundation 
Research Program of the Naval Postgraduate School with funds 
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that X has property C) . Thus ' Ar istotleGman ' would indicate that 
Aristotle is a man, and ' 26even ' would mean that 2 is an even 
number. (The symbol 'S' is an abbreviation for Scrtri , which is 
the Greek word for 'is'.) 

If 'x' and 'y' are names of individuals and 'R' is the name 
of a relation, then 'x R y' means that x bears the relation R to 
y. For example. 



Aristotle student Plato 

means that Aristotle is a student of Plato. Also, '2 < 3' means 
that 2 bears the less-than relation to 3, i.e., that 2<3. Where 
there is little chance of confusion, 'x R y' will be written 
'xRy' and ' xSP ' will be written 'xP'. The notation that we have 
introduced above will be extended to classes of classes, classes 
of relations, relations among classes, relations among relations, 
etc . 

2 . 2 relational descriptions 



If 'S(x)' is a sentence involving 'x', then a class descrip- 
tion is an expression of the form ’x(S(x))'. This denotes the 
class of all individuals, a, for which S(a) is true, i.e., 

a G X (S (x) ) 4-> S (a) 

Similarly, if 'S(x,y)' is a sentence involving 'x' and 'y', then 
'xy(S(x,y))' is a relation description which holds between a and 
b whenever S(a,b) is true, i.e., 

a [ xy(S(x,y)) ] b S(a,b) 

To illustrate this notation we will define the converse of a 
relation. 

2 . 3 converse 

The relation R~^ is called the converse of R, i.e. 

xR'^^y yRx . Using our notation for descriptions we can 

define. 



R ^ = x9(yRx) 

As an example of a relation among relations, we define ” ' " as the 
relation that holds between converses: 

r ' s r=s~^ 



Hence , 



f§ (r=s“^) 
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Some examples of converses are: 

parent"^ = child 

The following are easily proved properties of the converse: 



r ' s s ' r 

I 1 — I 



2. 4 arrow diagrams 

Relations can be portrayed by "arrow diagrams" (Haase 
diagrams). In such a diagram there is a node for each individual 
related by the relation and an arrow from x to y whenever xRy. 




represents the relation R such that 

bRa , cRb, dRb, eRd , eRe, bRe 

and ~xRy for all other cases. The effect of the converse opera- 
tor is to reverse all of the arrows. Hence, R”"^ is diagrammed: 




2.5 tables 



Relations can often be viewed as tables. For instance, the 
relation R of the previous section can be shown as a table: 

R 




Of course, it makes no difference in what order we write the rows 
of the table. 
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The converse of a relation is obtained by simply exchanging 
the columns of the table; 




Of course, classes are represented by one column tables. For 
instance the class C of primes less than ten is: 

C 

2 

3 

5 

7 



3. Domains 



We often need to talk of the individuals that can occur on 
the right or left of a relation. We say that x is a left - member 
of R whenever there is a y such that xRy. 

X Lm R <-> 3-y(xRy) 

For instance, if 'x parent y' means that x is a parent of y, then 
'Socrates Lm parent' means that Socrates is a parent. Right - 
member and member are defined analogously: 

y Rm R «-> 3x(xRy) 
z Mm R •4-> z Lm R V z Rm R 

These satisfy the identities: 

X Lm R <-> X Rm R~^ 
y Rm R y Lm R“^ 



4. Functions 



4 . 1 basic concepts 

Functions and relations are closely related. Consider the 
predecessor relation, 'pred': 

X pred y x = y-1 
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Thus, X pred y says that x is the predecessor of y. The 
corresponding arrow diagram is: 

1 2 3 4 5 

* >* — > > > 

and the corresponding table is: 




since 1 pred 2, 2 pred 3, etc. Notice that, in this case, for 
each right member x there is a unique left member y such that 
y pred x. This y can be written using Whitehead and Russell's 
[16] definite description : 



7 y (y pred x) 

This can be read: the y such that y is a predecessor of x. A 
more convenient way to write this is: 

pr ed : X 

In general, R:x means "the unique y such that y R x, i.e. 

R:x = 7y(yRx) 

This notation is meaningful only if there is a unique y such that 
yRx , i.e. 



yRx A zRx y=z 

That is, there is only one arrow leading to x. When this condi- 
tion is satisfied for all x we call R left univalent , symbolized 
by ' lun ' : 



RGlun Vxyz[ yRx A zRx y=z ] 

The left univalent relations are more commonly called functions . 
In a left univalent relation there is exactly one arrow leading 
to each node. Consider the "absolute reciprocal" relation: 

xRy X = |l/y| 



This is diagrammed: 



1 


1 


1 


-1 


1/2 


2 


1/2 


-2 


1/3 


3 


1/3 


-3 


• 


: 




Since RSlun it is meaningful to write R:x, so 
1/3. We can find R:x by following back the 
or by looking down the right column for 
corresponding element from the left column. 



we observe R:(-3) = 
arrow pointing to x 
X and taking the 



The concepts of 
defined analogously: 



right univalence and bi-univalence are 



RGrun Vxyz[ xRy A xRz y=z ] 

RGbun RGlun A RSrun 



Bi-univalent relations are also called bijections and one-one 
mappings . 



4 . 2 higher level functions 

Of course, the converse of a function is not necessarily a 
function. The 'sin' relation, defined so that y sin x means that 
y is the sine of x, is left univalent but not right univalent. 
Hence, we can write either y=sin:x or y sin x, but can express 
the arcsine only by: 

X sin~^ y 

The notation sin~^:y is meaningless. Since f:x is meaningful 
only when fSlun we will be careful to write f:x only when we have 
previously shown (or it is obvious) that fSlun and x Rm f. 



The fact that f:x may be meaningless makes it convenient to 
use several other relations derived from f. One of these is the 
plural description . If F is any relation and C is a class then 
Fl :C is the set of all y such that yFx for some x in C, i.e., 

F! = 2fi{z = y[3-x(yFx A xSC) ] } 

The tabular interpretation of F!:C is simple: 



II 

IE 



xl 



xn 



C F ! : C 



xl 




ii 


x2 




y2 


xn 




yn 



F 
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We see that, if F is any function, then F.':S is the image of the 
class S under that function. Notice that the operation F!:S is 
defined for all relations F and classes S, regardless of whether 
FGlun or the members of S are right members of F. For these rea- 
sons, it is generally safer to write F! :C than Fix- 

Related ideas are the image and_jconve rse image of an indivi- 
dual. If R is a relation, then c R x means that c is the class 
of individuals related to x. This class is called the referents 
of X, and is defined: 



R:x = y(yRx) 

The converse idea is that of the relate of y: 

R:y = x(yRx) 

Like the plural description, R and R are defined for all R and 
all arguments. 

Consider now the function 



=:x = 9(y=x) 

Hence, =:y is just the unit class containing y. Russell and 
Whitehead [16] write thTs Try. Conversely, if C is a single ele- 
ment class, then (=~^):C selects the unique member of that class; 

(=-l);C = 7x(x€C) 

It is thus a uniqueness filter. We will write this as 9;C: 



The expression 9;C can be read "the C" . 

We will occasionally need ^o refer to the relat£^ons that 
hold between R and R or R and R, which we write - and -, respec- 
tively; 

R ^ S R = t 

R - S <-> R = S 

The following are some properties of these operations; 




= R! : ( = :y) 

t = -:R 

■r = -;R 

It is often convenient to have names for domain extracting func- 
tions, e.g., lem:R is the class of left members of R. These are 
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simply defined 


using 


images : 








lem 


= Lm 






r im 


= ^ 






mem 


= Mm 


Of course the 


r ight 


and left 


members of a relation 



obtained by taking its right and left columns, respectively, and 
deleting duplicates. 






lem:R 



R 






r ira : R 



5 . Boolean Operations 



5. 1 logical connectives 



We will next investigate ways of combining relations and 
classes. The simplest methods are just abstractions of the logi- 



cal connectives 


used 


between propositions: Therefore, 


we define 


the intersection, 
relations : 


union, negation and difference of 


classes and 


x(S A T) 




xS A xT 




X (R A S)y 




xRy A xSy 




x(S V T) 




xS V xT 




X (R V S)y 




xRy V xSy 





x(-S) -(xS) 

x(-R)y -(xRy) 

X (S-T) xS A - (xT) 

x(R-S)y xRy A -(xSy) 

x(S-»T) xS xT 

x(R-^S)y xRy xSy 

As an example of the use of these operations, consider our previ- 
ous definition of Mm: 



z Mm R z Lm R V z Rm R 
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Using the union operation this can be written: 

Mm = Lm V Rm 



Similarly, 



bun = lun A run 

The logical connectives satisfy the usual properties of a Boolean 
algebra (e.g., DeMorgan's theorem). 

As an example of the use of these operations, we will define 
the closed interval function, m..n, which is the set of integers 
m, m+1, ..., n. It is just: 

m..n = >^:mA<Jn 

where ^ and < are the relations on integers. 

5 . 2 empty classes and relations 



It is useful to have names for the empty class and relation: 

$ = X(X7>^X) 

§ = x9(x7^x) 

Hence, x$ is always false, as is x§y. These are most often used 
for stating properties of relations and classes. For instance, 

SAT = $ 

means that classes S and T have no members in common. 

The universal classes and relations are also useful: 

I = -$ 

© = -@ 

For instance, 

S V T = 1 

means that every individual is either a member of S or of T. 
Notice that the class of the right members of a relation is just 
the image of the universe under that relation, i.e., 

rim:R = R!:$ _ 

lem;R = (R“^)I:$ 

mem:R = (R V R“^) ! :$ 



10 



5 . 3 Cartesian product 

It is often useful to have the maximum relation that can 
hold between two classes, i.e., the Car tesi an product of those 
classes. This is defined: 



S3JT = xy(xS A yT) 



The Cartesian product satisfies the following properties: 



(slt)*^ = tis 

lem : (sijt) = s 
r im : (silt) = t 
mem : (s3^t) = s V t 



sst(tAu) = (s^t) A (s^u) 

s|(t V u) = (s*t) V (Situ) 

sit(t-u) = (sitt) A (sit-u) 
s$(t^u) = (sl-t) V (Situ) 



@ = $ it'i 

sit$ = i|its_ = ijit'i ~ © 

sit = (Sit$) A (istt) 



5.4 subset relation 



Finally, we define the subclass and subrelation operations: 

SCT Vx(xS -» xT) 

RCS <-> Vxy(xRy — > xSy) 

The following are true: 

sCt — > (situ) C (titu) 

set -> (rls)C(ritt) 

set A uCv (situ)C(tJv) 

rSRel rC@ 

rGRel -> @Cr 

sGCls — > sC| 

sGCls —> $Cs 

where Cls is the class of all classes and Rel is the class of all 
relations (we are ignoring typing here). These can be defined: 

Rel = £:§ = £: © 

Cls = C:i| = C:$ 



6 . Limiting and Restriction 

It is often useful to limit the left or right domain of a 
relation. Consider the relation x sin“^ y, which means that x is 
an arcsine of y. We cannot write x = sin“^:y because sin”^ is 
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not left univalent (i.e. it is not a function). If we restrict 
y, the argument of sin, to the range -ir/4 to ir/4, then there is a 
unique x such that x sin“^ y. Let S be the class of reals in the 
range -ir/4 to »r/4 : 

xS {-?r/4_<x) A {x£!t/4) 

then we will write 

sin >S 

for the sine function with its arguments restricted to S. This 
function is bi-univalent, so it is invertible. If we call the 
inverse of this restricted sine Arcsin: 

Arcsin = (sin>S)~^ 

then it is perfectly meaningful to write Arcsintx (if x Lm sin). 
The right-restriction operation is defined: 

x(R>S)y xRy A yS 

The left-restriction is defined analogously: 

x(S<R)y xS A xRy 

These notations can be combined to restrict both domains: 

X (S<R>T) y xS A xRy A yT 

The combination s<R>s is so common that a special notation is 
provided for it: 

R5s = s<R>s 

For instance, <5P, where xP x>0, is the less-than relation 

restricted to positive numbers. 

The restriction operations are easily defined in terms of 
intersection and Cartesian product: 

s<r>t = r A (sift) 
r8s = r A (sits) 
s-tr = r A (siti) 
r>s = r A (ii(Es) 



12 



C 

x2 



xn 



R 



: 


i 


xl 


yi 


: 




x2 




: 


: 


xn 


yn 


: 







tR 


xi 


yi 


x2 


y2 


: 




xn 


yn 



Other properties satisfied by these operations are: 



s^t = s<t§>t 


lem: (s<r) = 


s A lem : r 


r i m : ( r > s ) = 


s A r im : r 


lem : ( r>s) = 


r ! ;s 


r im : (s<r ) = 


r ^ ! : s 


(s<t:r)~^ = 

(s<tr>t)"^ = 

(rSs)"-^ = 


(r‘^)>s 
t< (r"l)>s 
(r-l)Ss 


II It 




r>s A r>t = 


r>(s A t) 


r>s_V r>t = 


r>(s V t) 


(rJ^$)>s = 


ris 



7. Relative Product 



If xRy is the relation "x is a son of y" and xSy is the 
relation "x is a brother of y" , then the relative product , R|S, 
is the relation "x is a son of a brother of y." More formally, 

R|S = xi(3y (xRy A ySz) } 

Where there is little chance of confusion, we will write RS for 
R|S. If f and g are functions it is easy to see that f|g is the 
composition of these functions: 

X = fg : z 

X fg z 

3-y(x f yAy g z) 

3-y (x=f :y A y=g:z) 

X = f : (g : z ) 

Hence, fg:x = f;(g:x). 

It is convenient to have a notation for relative products of 
a relation with itself. For instance, the "grandparent" relation 
can be written parent I parent , which we abbreviate parent^. In 
general , 
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lO = 

= 

^n + 1 _ 

)-n 



(mem : R) 

R 

{R*^)R = R(R'^) 

(Rfi)-l 



ime obvious properties of the relative product are 



(rs) t = r (St) 
r(sVt) = rsVrt 
(rVs)t = rtVst 
r (s A t) C rs A rt 
(r A s) t C rt A St 
3-{rs) 4-> 3(rim:r A lem;s) 



= r 

(rs)"-^ - 






,(S (r 1) 



= r 



m+n 



(r 



mx n 



= r 



mn 



r r ^ 


H r 
= r 


-Ir = 


lem : 


r s C 


lem : r 


r im : 


r s C 


r im IS 


Lm 


= Rm ' 




Rm 


= Lm’ 




r© 


= ©r 


= © 


rl 


= Ir 


= r 



(m , n_>0 ) 
(m , n^O , 
(rS^un) 



or rSbun) 
( r Gbun ) 



where I = x^(x=y) 



Structures 



We have previously seen the use of arrow diagrams to 




■presents the relation R: 
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R 



a 


g 


b 


f 


c 


e 


d 


d 


d 


e 


e 


i 


f 


f 


f 


i 


g 


f 


g 


h 



8.1 initial and terminal members 



Now, notice that the left and right members of R are: 

lem:R = {a, b, c, d, e, f, g} 
rim.'R = { g, f, e, d, i, h } 

We define the initial members of R to be those members which are 
not pointed at by an arrow. Therefore, the initial members of R 
are the left members that are not right members. 

. . 

initrR = (Lm-Rm):R = {a, b, c} 



The terminal members of a relation are defined analogously: 

term:R = (Rm-Lm) ; R = {h, i} 

When a relation is used to represent a data structure, the above 
functions become important. 

For instance, a sequence is represented by a relation with 
the structure: 



S 



^1 



a2 




« O 0 





In this case init:S is the unit class containing the head (first 
element) of the relation (i.e., a^) and term:S is the unit class 
containing the last element of the sequence (i.e., a^j) . Simi- 
larly, S>(-init:S) is the sequence with its first element 
deleted : 

^2 ^3 ^n-1 ^n 
>- ■ ■■ > » * . » $»• 

Hence, the following common sequence manipulation functions can 
be defined (represented by lower and upper case alphas and ome- 
gas) : 
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oc: S 


= 9 init; S 


"first" 


uj; S 


= 9 term: S 


"last" 


n:S 


= S> (-init : S) 


"final" 


A:S 


= (-term:S)<tS 


"initial 



following properties of these relations are easy to show; 

oc = ui' 

LU = oc' 

A' = 'Q. 

a* = ‘A 



e operations on sequences are discussed in the next section. 

As another example of the use of 'init' and 'term', consider 
elation representing a tree: 

a 




n, 9 init; T is 'a', the root of the tree, and term:T is {d, 
i, f, j, k}, the leaves of the tree. The result is analogous 
forests. Given 



a. i 1 




set of roots is init;F and the set of leaves is term;F; 
init:F = {a,i,g} 

term:F = {c , e , f ,g ,h , j , k , 1 ,m ,n , t , u , v ,w} 
higher level operations 

The set of nodes whose parent is n is just F”^:n. For 
tance, the set of nodes directly descended from a root is 

F”^! : (init;F) = { b ,h , j ,o , p , r } 

set of nodes that point to leaves is 

F! ; {term;F) = { b ,d , a , i ,o , p , s , r } 
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These operations can be used for obtaining the maximum and 
minimum of sets. Suppose '<' is the less-than relation on 
integers and S is some set of integers, say {3,5,9}. Then 

<XS = < > * 

3 5 9 

Now note that 

init:(<??S) = {3} 

te rm : ( <XS ) = { 9 } 

Hence, if S is any set of numbers, then the minimum and maximum 
of this set are: 

min : S = oc: (<XS ) 
max : S = ua: ( <XS ) 

These operations are only defined if S has two or more elements, 
since an irreflexive relation cannot relate less than two ele- 
ments. That is, an irreflexive relation when restricted to a 
unit or empty class becomes the empty relation. Notice that we 
can select the maximum and minimum based on any relation that is 
a series (i.e., transitive, irreflexive and connected). If R is 
any series then oc: (R5S) is the minimum (relative to R) and 
uj:(R 3(S) is the maximum. 

The following are simple properties of these operations: 

init : r = term : ( r~^ ) 
term : r = ini t : ( r“^ ) 
init C lem 
term C rim 

init:(r8s) = term:(r“^Xs) 

init:(r Vs) C init:r V init:s 
init:r A init:s C init:(r A s) 
term: (r V s) £ term:r V term:s 
term:r A term:s C term: (r A s) 
in i t : (s$t) = s-t 

term:(s3tt) = ini t : (s|t) = init:(t3ts) = t-s 



9 . Sequences 



9 . 1 ordinal couples 



In this section we will continue the discussion of sequences 
begun in the last section. We saw that it was easy to define the 
following operations on sequences: 
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oc: S = © initrS 

ua:S = 9 term:S 

n:S = (-init:S)<S 
A:S = S>(-term:S) 



This provides us with functions for taking sequences apart. We 
will define the ordinal couple or pair , which puts them together. 
If X and y are two objects, then 'x,y' is the relation that 
relates x and y but no other objects. 

(x,y) = •- > 

X y 

That is, u(x,y)v if and only if u=x and y=v. This is formally 
defined by; 

x,y = uv(u=x A v=y) 

This notation will be taken to be right associative, i.e., 

x,y,2 = x,(y,z) 

Observe that 

oc: (x,y) = X 

uj;(x,y) = y 

It will occasionally be convenient to write ordinal couples in a 
vertical format: 



(p) = (x,y) 



This notation is extended for relations of more than one pair: 




The class of all the ordinal couples (or pairs) that can be 
made from the classes S and T is: 

SXT = ^Qxy[xSS A y€T A P= (x,y) ]) 

There is obviously a close relation between sXt and siSt. Later 
we will say that sjt is a Currying of sXt. Note that 

(x,y)GSXT X [S^T] y 

We will define a convenient notation for sequences of two or 
more elements: 

< x^, X2/-.W x^> = (X2,X2) V (x2fX3) V ... V (Xp,_3,Xp,) 

Therefore the sequence <a,b,c,d,e> is just 
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Also, note that. 



• ^ — 3»* 

abode 












^2 


^3 


: 


: 


^n-l 


^n 



9 . 2 catenation and consing 

If s and t are sequences then we can define an operation 
's'*t', which is the catenation of s and t. To form this catena- 
tion we must hook the last element of s to the first element of 
t : 




Therefore x [s"*t] y if and only if x s y, or x t y, or x=uj:s and 
y^:t. Hence, 



s^t = s V (ua:s, oc:t) V t 



The catenation operation is only defined for sequences, 
which are required to have at least two elements (since an irre- 
flexive relation with less than two elements is the empty rela- 
tion) . Note that we can extend the definition of sequences so as 
to allow length one sequences by making the relation reflexive. 

s V (= X mem:s) 

Q Q Q D 

Si S2 S3 Sj^ 

The one element sequence is then: 

Q 

Sq = (Sq' 

The full rami f ications of this definition of sequence have not 
been investigated. 



How do we add a single element to the left or right of a 
sequence? The "cons left" and "cons right" operations are easy 
to define: 



X 



cl 



^1 





s 



n 
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X cl s = (x, oc:s) V s 
s cr y = s V (ui;s, y) 

It is easy to show that if s is a sequence, then: 

oc; (x cl s) = X 
D.; (x cl s) = s 

uj: (s cr y) = y 
A: (s cry) = s 

(oc:s) cl(Q.:s) = s, if)t:s>2 

(A:s) cr (ui:s) = s, if #:s > 2 



Also, if s is a sequence, then s V (ui:s, oc:s) is a ring formed by 
joining the last element of s to the first element. 



If s is a sequence, then s ^ is the reverse of s. Hence, 
rev:s = s“^ 



oc:s = uj: s 
uj:s = oc:s~^ 

A:s = (a:s~|-)~|- 

H: s = (A:s ■*•) 

(s"t)~^ = t~^"s~^ 

(x cl s)~-^ = s“-*- cr 

(scrx)~^ = xcls 



(x,y)~^ = (y,x) 

< Xj^, X2^..., = < Xj^,..., y. 2> ^i> 

f^^l "^2 ••• ^ fy^ V2 ... y^^ 

\^l y 2 ••• Ynj " V>^1 X 2 ... xj 

If S is a sequence and x Mm S, then S~^:x is the successor 
of X in S and S:x is the predecessor of x in S (if these exist) . 

S~^:x = successor of x in S 
S:x = predecessor of x in S 



These are convenient ways of moving around within a sequence. 
Also, note that if s is a subsequence of t then sCt. 



Some additional identities are: 

(?)a) = (^) 

oc! : (SXT) = S 
UJ! : (SXT) = T 
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Finally, we will state the formal definition of a sequence: 
a relation is a sequence if it is a connected irreflexive bijec- 
tion. That is, 

sequence = connex A irrefl A bun 

sGirrefl s° C s~^ 

sGconnex lem:s = init:s V rim:s 

A rim:s = term:s V lem;s 



10. Binary Operations 



10.1 basic concepts 



In this section we will discuss our approach to binary 
operations - that is, to functions with two arguments and one 
result. We have already seen how unary functions are connected 
to relations. For instance, we can write the fact that y is the 
sine of x by either: 



y sin X 
or 

y = sin ;x 

Since we only deal with binary relations, we will have to have a 
new convention for handling binary functions. This convention 
is: we will combine the two arguments of an operation into a 

pair. For instance, we can define a relation 'sum' such that 

X sum (y,z) 

if and only if x is the sum of y and z. More formally: 

sum = xa(a=(y,z) A x=y+z) 

We can use our colon convention as usual, e.g., 

X = sum:(y,z) x sum (y,z) 

Now, it would be inconvenient to have to invent names, such as 
'sum', for each operation, such as '+'. Hence, we will adopt a 
systematic convention for making such names: either placing the 
conventional infix symbol for the operation in parentheses or 
underlining the symbol. For instance, 

x+(y,z) X = +:(y,z) x = y+z 

In fact, if ir is any infix operation symbol, we will explicitly 
define its meaning by 
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xiry = it: (x,y) 

This notation will permit us to manipulate in a more regular 
fashion the usual arithmetic operations (+, *, /) as well as 

the relational operations (e.g. At V, X/ </ >» 3, For 

instance, if S is a class of classes, then 

(A) ! :SXS 

is the class of all pairwise intersections of members of S. 

10.2 operations on binary operations 

It is often convenient to be able to generate simple rela- 
tions from a binary operation. Following Russell and Whitehead, 
let IT represent any binary operation. We define: 

irz = x9(x = yiTz) 
yir = x2(x = yirz) 

Hence , 

x(-l)y X = y-1 

therefore (-1) is the predecessor relation. Similarly, 

X ( 1+) y X = 1+y 

therefore (1+) (or (+1)) is the successor relation. These can be 
used as functions: 

( -1 ) : X = x-1 

( +1 ) : X = x+1 

This convention makes it very easy to form more complex func- 
tions. For instance, if we want 

f :x = sin : (1/x) 

then we can define 

f = si n ( 1/) 

To see that this works: 

f :x = (sin (1/) ] :x 

= sin ; ( (1/) : x ] 

= sin : ( 1/x ] 

Now observe the action of the (x,) and (,y) functions: 

(X, ) :y = (x,y) 

( ,y) :x = (x,y) 
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Therefore, for any binary operation it (except we can define 



Let's see why this works: 

(yir) :z = [jr (y,) ] :z 
= jr: [ (y, ) :z] 

= ir: [y ,z] 

= yirz 

(n'z):y is analogous. In general, if f is a binary function, then 
f(x,) and f(,y) are the "partially instantiated" unary functions. 
This is the effect of Curry and Feys "B" combinator [5]. 

Since is the reverse of a sequence, ?r I ' is the reverse 

form of an operation. For instance, reverse subtract 

operation : 



Thus can be read "subtract from" and /' can be read "divide 
into". This is Curry and Feys "C" combinator (see the next sec- 
tion) . 



11 . Combinators 

In this section we will discuss several powerful operations 
for manipulating relations. These are called combinators because 
of their similarity to the combinators of Curry and Feys [5]. 

The first combinator we will discuss is the paralleling of 
relations, which is defined: 



TTZ 

yrr 



1 ( /Z) 

£(y f ) 



(x,y) 



: (x,y) ) 
(y,x) 



y-x 




So, if f and g are functions 





Hence, X is the element-wise combination of f and g. 
pie, if^we want f:(x,y) = sin:x + sin:y, we can write 



For exam- 



f 




since 
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f : (x,y) 



(t):( (J)) 

(*)= 

sin :x + cos :y 



One of the simplest combinators described by Curry and Feys 
is the elementary cancellator , K, defined so that K:x is a func- 
tion such that (K:x):y = x for all y. That is, K generates con- 
stant functions. Since K;x is a relation that relates x to 
everything, we can define it: 

K = (l|)i 

where i = 9“^ is the unit class generator. To see that this 
works, note that 



K;x = (i^$)i:x = (i:x)|(| 



and therefore that 

u(K;x)v u[(i:x)?t$]v 

«-> uGi:x A vGf «-> u=x 

Therefore, (K:x):v = x. 

Another combinator is the elementary dupl ica to r , W, defined 
so that 



(W:f ) : X = f : (x,x) 



If we define = (x,x) then it is easy to see that W:f is just 

fA» f’or instance, (*)A is the squaring function; 

(*)A:n = (*) : A:n) 

= (*) : (n ,n) = n*n = x\^ 



It should be clear 
since 



that Backus' [f,g] combining form is just 




our 



Since this combination is so common we will adopt a special nota- 
tion for it; 



Hence , 





f 

7 



A 



Some of the properties satisfied by these combinators are: 
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R T _ RT 
S IT " '^TT 

fR\ n ^ R*^ 

" gtl 

g)T = I? 

R T| _ RTI 

-ol “ 

^ ^ 

R I _ S _ I R 

-S' “ T? ■ -S' 

■ n = ^ 

oc ^ = R} (Rm : S ) 

oi = S> (^:R) 

R ^ Fbc 
Slu 

cl = f|-l 

As an example of these combinators it is easy to show that 

( = 

is the function f:t = t^+2t. 

Another combinator is the meta-application operator, 
which corresponds to Curry and Feys' S combinator: 

(f;;g):x = (f;x):(g:x) 

For instance, [(l)']::init is the operation that gives the set of 
descendents of roots of a forest, F, since 

([ (1 ) ' ] : :init) :F = (F"^ I ) ; ( ini t : F) . 

The formal i zing combinator , $, is defined so that 

($:(f,a,b)):x = f:(a:x,b:x) 

It is easy to see that 



$ : (f ,a ,b) = f§ 



For instance. 



f 



!:(( + ), (*)A/ 2*) 
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is just the function f:x = x +2x. This can be written more 
clearly using the notation of our relational calculus: 

f = 

Another combinator defined by Curry and Feys is the combi- 
nato r : 



[*i: (f / 9 ) ] : (x,y) = f:(g:x,g:y) 

This is simply defined by 

•p: (f ,g) = f| 

so that 
Therefore, if 

f = *i:(( + ), (*)Z^ 

then f:x = x^+y^. 

One final operation we wish to define in this section is 
"Currying". This relates a relation to the correponding class of 
pairs. If S is a class of pairs, then Curry:S, the Currying of 
S, is the relation R such that xRy if and only if (x,y)SS. For- 
mally, 



Curry:S = x9[(x,y)SS] 

The inverse operation, Curry“^:R, is also useful. 

Some properties satisfied by these combinators are: 

Curry: (SXT) = S]fT 

(K:x)f = K:x 

f(K:x) = K:(f:x) = Kf:x 

AA = I 



12. Ancestral Relations 



12.1 definition 

Carnap [2] defines the relation of a property p being 
Hereditary with respect to a relation r: 

p Her r Vxy{x6p A x r y — > y€p} 

r~^ ! :p C p 

This leads to the definition of the ancestral of R of the first 
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kind as that relation which preserves all the hereditary proper- 
ties of R. This is also called the transitive closure of R: 

X R* y X Mm R A ’^p[p Her R A x€p — > ySp] 

For example, if xPy means that x is a parent of y, then xP*y 
means that x is an ancestor (or the same as) y. The ancestral of 
the second kind is also useful: 

R''' = R*|R 

Thus, P"** means "ancestor" in the colloquial sense. The easiest 
way to visualize the meanings of the ancestrals is by their 
expansion as infinite unions: 

R* = R° V V r2 V R^ V . . . 

R''' = r1 V r2 V R^ V R'^ V . . . 



Here are some useful properties of the ancestrals: 

R"^ = R*-( = ) = R*-r‘^ 

xR*y 3n[n>0 A xR'^y] 

R° C R* 

R" C R*, for n>0 
C R+, for n>0 

RIR'*' = R"^ 

R"*" C R* 

* 

V R'*’ 



>+ ^ 



: T 



(R+) 1 = (R 1)+ 



(r?fs) 



* 

r 5{s 



Ancestral relations are always transitive. Notice that ^ and > 
for integers can be defined: 

> = (1 + )* 

> = (1 + )'^ 

The ancestral "fills out" all of the paths in a structure. For 
instance, if 

R ~^1 ^2 ^3 ^4 

> » > » > 



then 
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2 a pplications 

Suppose that S is a sequence and we wish to find the first 
iber of S which satisfies some property P. First form the clo- 
e S"^, so that for any two members of S'*' we can tell which is 
■St. Next, eliminate from S"*" any members that do not satisfy 
Then, oc: (S‘'’XP) is the first member of S satisfying P. 

Next we will consider a simple character manipulation exam- 
i; ^tripping leading blanks from a string. Note that 
y cl) z means that x is a result of consing 0 or more y's on 
! front of z. Hence, 




z [ (y cl ) *] ^ X 

ins that z is the result of stripping one or more y's from the 
int of X. To get the desired result it is only necessary to 
itrict the left domain of this function to be sequences that 
I't begin with a y. Suppose Y is the property of beginning 
;h a y: 



xY <-> y= oc:x y oc x 4-> x G oc:y 

irefore, the function to strip leading y's from a sequence is: 

(-<x:y) < ( (y cl) 

Before we leave the topic of ancestral relations, it will be 
iful to investigate their use as a means of iteration. Suppose 
;t F is a function (i.e., left univalent). Then, since 

F* = f'^ V F^ V F^ V • . . 

will have yF*x if and only if for ^ome n, y = F'^tx. In gen- 

.1^ there may be many such n, so F may not be left univalent. 

F is to be a function, it is necessary to pick a termination 
di tion T (a cl^ss) that is only true for one of F^ : x , F-^ ; x , 

X, .... Then T-tF is just the function sought; it is roughly 

ivalent to 



while ~T ^ F 

logously, T<F''' is roughly equivalent to 
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repeat F until T 



13. Arrays 



13.1 definition and basic operations 



An array is just a function from a contiguous subset of the 
integers to some set of values. If A is an array _and i Rm A then 
A:i is the i-th element of A. Similarly, if I C Rm:A is a set of 
index values then A!:I is the corresponding set of array values 
and A>I is the subarray of A selected by those indices. 

It is easy to define multi-dimensional arrays; they are 
just arrays whose elements are selected by sequences of integers, 
e.g. M:(i,j). If M is a two-dimensional array, then M(i,) is the 
i-th row of M and M(,j) is the j-th column of M. Also, if I is a 
set of row indices and J is a set of column indices then M>(iyj) 
is the submatrix of M selected by these sets. It is easy to see 
that M' is the transpose of M, since 

M':(i,j) = M:(':(i,j)) = M:(j,i) 

More generally, if P is a permutation function (i.e. a bijection 
from an index set into itself) then AP is the result of permuting 
A by P. 



Suppose X i£ an element of the array A (i.e., for some i, 
x=A;i). Then A:x is the set of all indices for which x=A:i. 
Therefore we can find the index of the first occurence of x in A 
(i.e. APL's iota operator) by min'Xix. In general, if P is some 
property (i.e. class), then A”^!:P is the set of indices of all 
elements of A that satisfy P. A sorted reflexive sequence of 
these indices is just < 5 (A“^!:P) 

13.2 relation to sequences 



It is easy to convert arrays to sequences and vice versa . 
Suppose all the elements of A are distinct, then A~^^ Ts a func- 
tion that returns the index of an element of A. We want to 
define a sequence S such that xSy if and only if x preceeds y in 
A, i.e. the index of x is one less than the index of y. 

xSy «-> (A“]-;x) = (A“^;y)-1 

•€-> (A”^:x) (~1) (A~^:y) 

x[A| (-1) |A"^]y 

Hence, S = A(-1)A“^. 

Next, we will consider the opposite process: converting a 

sequence to an array. Suppose we have a sequence: 
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S 



^0 



>• **- 



^3 



We wish to convert this to an array: 

ao— ^0 
A = a^^* — 




Thus, for each element a^ in the sequence, we must find its index 
i in the resulting array. If we can define a relation R such 
that R:a^ = i then R~^ will be the array we seek. Now R:aj^ is just 
the number of predecessors of a in S. That is, Sq has no prede- 
cessors, so R:3 q = 0; a^ has two predecessors, so R:a 2 = 2, and 
so on. Since S defined an immediate predecessor relation. S'*’ 
defines an ancestral predecessor relation: 



S+ 




The set of predecessors of any element a is then S''’:a, e.g. 

S ’*" : 3 2 = ^1^ 

The size of this class is then the desired index: 

= 2 

Now, we know that A is R~^, 
so we can define the function saO which converts a sequence into 
a 0-origin array: 

saO:S = 

To produce a 1-origin array, the only alteration is: 

sa:S = 

13.3 other array operations 



#: (S + :a2) 



Hence, R:a = #: (S^:a) , so R = fS'*’. 



Next we will consider the concatenation of arrays. If A is 
an array such that A:i = a^^, then we can write A: 

A = ^ (a2'2) V ... V (a^,m) 

where m is the length of the array. Similarly, suppose that B is 
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an n element array, then the concatenation of these arrays is 

A cat B = (a 2 ^ 1 ) V- • • V ,m) V , m+1 ) V • ♦ • V (bj^ ,m+n) 

We can see that A cat B = AVB ' where B' results from B by shift 
its indices by m: 

B’ = (b;^,m+l) V ... V (bj^,m+n) 

How do we compute B'? Observe: 

xB ' i xB(i-m) xB[(-m):i] <-> xB(-m)i 

Hence, B' = B(-m) and A cat B = A V B(-m) , where m is the length 
of A. The length of A is just #rim:A, so 

A cat B = A V B(-#rim:A) 

We will finish our discussion of arrays by investigating the 
generation of sorted arrays. Let S be a set of integers to be 
sorted, then is a structure which relates lesser el ement s to 

greater elements. Now if x is any element of the se^, (<XS):x is 
the set of all el emen ts . 1 ess than x. Thus [ (_<XS ) ] : x is the 

number of elements of S less than or equal to x. This is just 
the index of x in the sorted array we seek. Hence if A i s^ the 
sorted array, xAi if and only if i [ # (_<XS^ ] x , so A = [ If ] ~^ . 

Of course this can be generalized to any ordering relation. 



14 . Scanning Structures 
14.1 basic concepts 



In this section we will discuss several methods for scanning 
structures , that is, for applying a function to each element of a 
structure and accumulating the results. Since no one method has 



yet 

work 


been selected, this section should be taken 
in progress. 


as a 


r epo r t 


of 


file 


A general paradigm for processing a structure 
, is the following: 


, such as 


a 


1. 


Perform some ini tial i za tion . 








2. 


Read the next (or first) element of the file. 








3. 


Take this value and the results of processing 
values . 


the 


previous 


4. 


Process these to yield new cumulative values 
f rom step ( 2 ) . 


and 


continue 
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5. When the end of the file is reached, return the accumulated 
result of processing all of its elements, 

A simple form of this appears in APL ' s reduction operation: 

+/V = V^+(... 

A more general form is Backus' insert: 

/f : <x^ , . . . , Xj^> = f:<X2f*.. f : <Xj^_^ , x^^> • . . > 

Our first example of scanning structures will be to express this 
operation in the relational calculus. 

14.2 reduction of arrays 

We are given an n element array A and wish to compute: 
t = A:n + A:(n-1) + ... + A;2 + A:1 



where we have assumed that the right members of^A are l..n. We 
saw in the section on ancestrals that T<F will iterate the 
application of F with T used as the termination condition. Con- 
sider how the analogous loop would be written in Pascal: 



S : = 0 ; i : = 0 ; 
while ij^n+l do 

begin S :=S+A[i]; i := i+1 end 



On each iteration two functions are performed: S is incremented 
by A[i] and i is incremented by 1. Let's represent the state of 
the computation by a pair (s,i), where s is the cumulative sum so 
far and i is the index of the next element to process. We will 
use F to represent one processing step, so that, if (s',i') is 
the new state, we can solve for F as follows: 



F 





( 



s+A : i\ 
i + 1 ) 



Hence, F 




/ ( + )J: (s,i)\ 

\^( + l)Lu: (s,i)y 



{+r)'uj 




<+)ff 

{ + 1 ) Uj * 
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It remains to determine the termination condition, T. If x 
is a state, i.e., a pair (s,i), then xST when i=n+l. Hence, x€T 
when ui: X = n+1, so 

xGT Ui:x = n+1 

«-> (n+1 ) in X 

X S tu: (n+1 ) 

Hen^e, T = ai;(n+l). The final state, Xg, containing the sum is 
T<F : x^, where Xj^=(0,l) is the initial state; 

Xf = (T<F*):(0,1) 

Now, the total t is just octx^, so 

t = oc(T<F*) : (0, 1) 

We can generalize this to any function f with initial value i; 
t = oc(T<F*) : (i,l) 

^ = r + TJuT 

This result can be improved by directly extracting the 
result from the fi^al state. That is, we want to define a filter 
such that t = :(i,l). Hence we want ts^x^, so 

t (t,n + l) 

Now, note that [,n+l]:t = (t,n+l), so 

(t,n+l) [,n+l] t 

by the definition of Therefore ^ = [,n+l]~^ and we have the 

simplified formula 

t = (,n+l)"^F*: (i,l) 

14.3 reduction of sequences 

Next we will consider the scanning of sequences. Suppose S 
is a sequence: 



S — ^s^,S2f***,s^,EOF^ 

where EOF is an "end marker"; it can be any value. Now, we wish 
to find the result 



that is 



i f S]^ f S2 f 



f "n 
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for some function f and starting value i. The state can be 
represented by a pair (t,s), where t is the result so far com- 
puted and s is the rest of the sequence to be processed. Hence, 
(t',s') = F:(t,s) where t' = f:(t,oc:s) and s' = 0.: s . Therefore, 



Hence , 




(t/]c;s) 

a: 



) 



( (t,s)^ 

\Dui: (t,s)/ 




F 




nuT 



What is a terminal state? Notice that D.:<Sj^,E0F> 
terminal state will have the form (r,§). Hence, 



r = (,§)-1 f*; (i,S) 



©, so a 



To put this in a more useful form, we will define a function f0i 
such that r = (f0i) :S. This is simply 



f I 

fi^i — (r©) ^ rro ( ^ ' ) 

Then, the sum of the elements of a sequence S is just (+)0O:S. 
14.4 scanning general structures 



It is often useful to scan a structure while performing some 
processing at each node. When the data structure is a sequence 
this amounts to APL ' s reduce operator and Backus' insert opera- 
tor. We will define a scanning operation that works on a more 
general class of structures. This operator can be understood 
intuitively as follows; The state of the scanning process is 
represented by a set of "read heads" each of which is "positioned 
over" a node and holds state information accumulated from the 
nodes it has already visited. A node can be processed when a 
read head has moved to that node over each edge which leads into 
the node. When this occurs a processing function is applied to 
the node (as first parameter) and the union of the state informa- 
tion of each of the read heads (as second parameter) . The result 
of this processing step becomes the state information associated 
with a new set of read heads which are advanced along each edge 
leading out from the node. The processing of the structure is 
completed when all read heads have arrived at terminal nodes 
(hence this scanning operation is not defined for cyclic struc- 
tures) . Scanning a structure is started by positioning a read 
head with initial state information over each initial node. 



The scanning operation is symbolized by fli, where f is the 
processing function and i is the initial state for the read 
heads. For instance, if V is a vector, (+)I0:V will scan the 
elements of V using (+) (i.e. APL +/V or Backus' (/+):V). For a 
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more interesting example, suppose T is an attributed parse tree, 
E is a function that evaluates attributes and B is the initial 
set of attribute bindings. Then EIB:T propogates the values of 
inherited attributes down to the leaves of the tree. Conversely, 
EIB:(T~^) propogates the values of synthesized attributes back to 
the root. Hence, repeated applications of EJB and (EIB)’ will 
evaluate all of the attributes. Of course, this program will 
work just as well if T is a forest of parse trees. The I opera- 
tor is still undergoing evaluation as it is one of several possi- 
ble structure-directed scanning operations. 



1 5 . Examples 



In this section we will give several examples of relational 
programs . 

15.1 p ayroll 

Suppose we have a file $ of employee records, where r = <|:n 
is the record for the employee with the employee number n. We 
will suppose that employee records are functions defined so that: 

r:N = employee name 

r:H = hours worked so far this week 
r:R = pay rate 

We are given an update file U such that Urn is the number of 
hours worked by employee n today. We wish to generate a new pay- 
roll file $ ' . 

SOLUTION: Let r = $:n and r' = |':n be the old and new 
employee records. It is clear that r' is the same as r except 
for its H field. In order to modify part of a relation, we will 

use the Md function defined by: 

Md : (S,R) = R V S> (-^:R) 

Then, if h' represents the new value of the H field, the new 
employee record is 



r' = Md:{r, (h',H)) 

where h' is just the cumulative hours worked: 

h' = ($:n) :H + U:n 

Therefore, by the definition of 

$':n = r' = Md:(i|:n, (h',H)) 

To find we must factor out the employee number n. To do this, 
note that ($:n):H = (:H):($:n) = (:H)$:n. That is, (:H)f is a 
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slice of the payroll file: the hours worked for each employee. 
Therefo re , 



h' = ($:n):H + U:n = (:H)$:n + U:n 

= ( + ) :n 

Now, define the updating function u by 
u:n = ( (+)IlgIiJ :n, H ) 



(,H) (+)Iigil 



: n 



Then, $':n = Md:($:n,u;n) = Md^tn Therfore, the solution to 

our problem, the new payroll file, is 

= Md|J 

where u = ( , H) ( + ) ( • ^ 



15.2 check issueing 



Suppose we wish to take the payroll file from the previous 
example and generate checks for the employees. We will assume 
that a function C is available such that C:(nm,p) returns a check 
in the amount p made out to the name nm- 



SOLUTION: We will ignore overtime computations. Hence, if n 
is an employee number then $:n:N is his name and 



p:n = $:n:H * $:n:R 

is his pay. Hence, his check c:n is c:n = C:(nm,p:n) = C 



— = 17 ( * 

\ P:n y 



Nil. 



. / nm \ 

•(p:ny 



: n 



Combining these we have the file F mapping employee numbers into 
checks : 



F = cl 

* 

from which we can factor out the old payroll file: 




If we just want a set of checks, this is Lm:F. 




16. Implementation Notes 

The primary goal of our investigation has been to determine 
if relational programming is significantly better than 
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conventional methods. It would be premature to devote much 
effort to implementation studies before it is even determined if 
relational programming is an effective programming methodology. 
However/ a brief discussion of implementation possibilities is 
probably not out of line. 

The most obvious representation of a relation is the exten- 
sional representation, in which all the elements of a relation or 
class are explicitly represented in memory. There are many kinds 
of extensional representations, such as hash tables, binary trees 
and simple sorted tables. Of course, performance can be improved 
through the use of associative memories and active memories (in 
which each memory cell has a limited processing capability). 

Some relations and classes will be so large that it is 
uneconomical to represent them explicitly in memory. In these 
cases an intensional representation [11] should be used. Here a 
class or relation is represented by a formula or expression for 
computing that relation or class. Operations on the class or 
relation are implemented as formal operations on the expression. 
This is feasible because of the simple algebraic properties 
satisfied by relations. It can be seen that an intensional 
representation is really just a variant of a lazy evaluation 
mechanism [9, 10]. Sometimes an intensional representation is 
necessary; for instance, relations of infinite cardinality, such 
as the numerical operators and relations, require an intensional 
representation. 

Although the programmer could be allowed to choose between 
extensional and intensional representations for his relations, 
this is not necessary. It is probably feasible, and certainly 
higher level, to have the system choose representations on the 
basis of cardinality estimates of the classes and relations 
involved. The algebra of relations is regular enough that many 
of these decisions can be made at compile time. Any that can't 
can be deferred to run-time when exact cardinality information is 
available. See [14] for related techniques. 



17. Conclusions 



Of course, we are not the first to propose introducing 
aspects of a relational calculus into programming. Codd [4] has 
used a relational calculus as the basis for data base systems. 
Although he defines several operations on relations ( viz ., premu- 
tation, join, tie, composition, and restriction), this small set 
of operations is insufficient for general purpose programming. 
These remarks also apply to Childs' reconstituted definition of 
relations [3], which are also oriented towards data bases. Feld- 
man and Rovner [6] augmented Algol with several relational opera- 
tors for associative access to a data base. Their operations, 
which are our plural description and image, are quite limited, 
being based on a traditional von Neumann language. 
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One general purpose language that does make extensive use of 
sets and relations is SETL [7], which provides most of the fami- 
liar operations on sets (e.g., union, intersection, difference, 
powerset, image). SETL differs from relational programming in 
three significant respects: (1) it can only handle finite sets , 
(2) many operations must still be performed in a wo rd-a t-a-time 
fashion using the set former , and (3) it resorts to conventional 
tontrol structures. 

Finally, we must mention "logic programming" systems, such 
as PROLOG [15, 8], which use predicate logic to describe computa- 
tional processes. These systems also differ from relational pro- 
gramming in several significant respects: (1) they have a word- 
at-a-time programming style due to the use of variables 
representing individuals in the clauses of the program, and (2) 
they are implemented using a resolution theorem prover, whereas a 
nore conventional procedural implementation suffices for rela- 
tional programming. Essentially the same remarks apply to 
Popplestone ' s relational programming [13], which is like logic 
programming except that it uses "forward inference" rather than 
"backward inference". 

In summary, no other programming style that we are aware of 
:ombines the universal use of relations with a rich set of opera- 
tions on those relations that can be implemented in a determinis- 
tic, procedural way. It is hoped that the preceeding discussion 
las made plausible some of the advantages claimed for relational 
programming in the Introduction. Considerable work remains to be 
lone in evaluating the effectiveness of a relational calculus as 
3 programming tool. For instance, the optimum set of combinators 
and relational operators must be selected. Another non-trivial 
problem is the selection of a good notation for the relational 
:alculus. More from convenience than conviction we have used the 
notation of [16] and [2]. Making relational programming an 
affective tool will require designing a notation that combines 
readability with the manipulative advantages of a two-dimensional 
algebraic notation. This is all preliminary to any serious con- 
siderations of software or hardware implementation techniques. 
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